Skip to content

feat(pipeline): include triple count in import result reporting#217

Open
ddeboer wants to merge 2 commits intomainfrom
feat/import-triple-count
Open

feat(pipeline): include triple count in import result reporting#217
ddeboer wants to merge 2 commits intomainfrom
feat/import-triple-count

Conversation

@ddeboer
Copy link
Member

@ddeboer ddeboer commented Mar 7, 2026

Summary

  • Add optional tripleCount property to ImportSuccessful so importers can report how many triples were indexed.
  • QLever importer appends cat <metadata>.json to the index command and parses num-triples.normal from the container logs using a regex (resilient to Docker log multiplexing binary frame headers). No host filesystem access needed.
  • Thread tripleCount through ResolvedDistributionProgressReporterConsoleReporter.
  • Console reporter displays the count using compact number formatting when present, e.g. ✔ Imported …/file.nt (4.8M triples, to http://…) in 22.8s.
  • The triple count is optional — importers that don't support it (e.g. Fuseki) simply omit it.

- Add optional tripleCount to ImportSuccessful (sparql-importer)
- Parse num-triples from QLever container logs instead of reading
  metadata file from host filesystem (volume-agnostic)
- Thread tripleCount through ResolvedDistribution and ProgressReporter
- Display formatted triple count in console reporter, e.g.
  '✔ Imported …/file.nt (4.8M triples, to http://…) in 22.8s'
@ddeboer ddeboer enabled auto-merge (squash) March 7, 2026 19:33
- Use regex instead of JSON.parse to extract num-triples from the
  metadata JSON that the index command cats to stdout
- Docker log multiplexing prepends binary frame headers that corrupt
  JSON parsing; regex is resilient to these
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant